IN THE CLAIMS . 

Please substitute the following claims for the same numbered claims in the appl 



ication. 



Claim 1 (Currently Amended): A method for evaluating and outputting a final clustering 
solution for a plurality of mum-dimensional data records, said data records having 
multiple, heterogeneous feature spaces represented by feature vectors, said method 

comprising: 

defining a distortion between two feature vectors as a weighted sum of distortion 
measures on components of said feature vector vectors : 

clustering said multi-dimensional data records into k-clusters using a We* 
progidimning" convex nmpn-m,^ formulation; and 

selecting feature weights of said feature vectorsr,and 
minimizinp dktnrtion of sa ,Vi ^..^p. 

Cain, 2 (Currently Amended): The method according to cIaim „ ^ 

of feature weights are is optimized by an p^We funcion ,„ prodllce ^ 

-W-f «na, catering ^ tha, simultaneously miniraizes average 

dispersion and maximizes average inter-cluster dispersion along all said fcejersge^ 

feature spaces. 

Claim 3 (Original,, The method according to claim ,. wherein said clustering inc.udes 
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initially applying a local minima of said clustering. 



Claim 4 (Original): The method of claim 1 , wherein said clustering comprises a k-means 
clustering algorithm. 

Claim 5 (Currently Amended): The method of claim 2, wherein said minimizing 
distortion of individual clusters includes taking said data records and iteratively 
determining Voronol partitions until said «objeerm« objective function, between two 
successive iterations, is less than a specified threshold. 

Claim 6 (Original): The method of claim 1 , wherein said clustering comprises arming 
word data, and said feature vectors comprise multiple-word frequencies of said data 

records. 

Claim 7 (Currently Amended): The method of Cairn I, wherein said clustering comprises 
analyzing data records having numerica. and categorical attributes, and said feature 
vectors comprise linearly-scaled numerica, attributes »*«H Wo « a „ 
using a 1-in-q np^uHaliun of j , aid data icc u ids . 

Claim 8 (Current Amended): A method for evaluating and outputting a clustering 
solution for a plurality of multidimensional data records, said data records having 
multiple, heterogeneous feature spaces represented by feature vectors, said method 
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comprising: 

defining a distortion between two said feature vectors as a weighted sum of 
distortion measures on components of said feature vector vectors : 

clustering said multi-dimensional data records into k-clusters using a "convex 

_ W convex propramminc formulation of a generalized k-means clustering 

function; and 

selecting optimal feature weights of said feature vectors by an "objective" 
objective function to produce said solution of a final clustering that simultaneously 
minimizes average intra-cluster dispersion and maximizes average inter-cluster dispersion 
along all said feature spaces. 

Claim 9 (Original): The method of claim 8, wherein said clustering includes i„ itia „ y 
applying a local minima of said clustering. 

Claim ,0 (Current., Amended): The method of claim 8, whe rei „ said minimizing 
distortion of individual clusters includes taking said data records and i,e ra «ive.y 
determining Vorono, partitions until said •Wjeetire- ab^e function, between two 
successive iterations, is less than a specified threshold. 

Cairn 1 1 (Original): The method of Cairn 8, wherein said clustering comprises analyzing 
word data, and said feature vectors comprise muhiple-word frequencies of said data 
records. 
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Claim 12 (Currently Amended): The method of claim 8, wherein said clustering 
comprises analyzing data records having numerical and categorical attributes, and said 
feature vectors comprise linearly-scaled numerical attributes and cadi q utj uitiguiiuil 
fcatuii u&i ng a 1 iii^q iLpi - cj ,L n t ati u ii uf aoid data i c i u ids . 

Claim 13 (Currently Amended): A computer system for data mining and outputting a 
final clustering solution, wherein said system includes a memory for storing a database 
having a plurality of multi-dimensional data records, each having multiple, heterogeneous 
feature spaces represented by feature vectors, said system including a processor for 
executing instructions comprising: 

defining a distortion between two feature vectors as a weighted sum of distortion 
measures on components of said feature vector vectors : 

clustering said multi-dimensional data records into k-clusters using a <Wex 
pregiauiiuum* convex pmpn,mm,»p formulation; and 
selecting feature weights of said feature vectors:, 

inter-cluster dispersion a lnn p ,n „ H featMre , pa , go , 
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Claim 14 (Cancelled). 



Claim 15 (Original): The system of claim 13, wherein said instruction of said clustering 
includes an instruction for initially applying a local minima of said clustering. 

Claim 16 (Original): The system of claim 13, wherein said instruction for clustering 
includes instructions for implementing a k-means clustering algorithm. 

Claim 1 7 (Currently Amended): The system of claim W 13, wr*rein™d further 
^ ^ inStrUCtion for min ™g distortion of individual clusters meted** 
io^ding taking said data records and itenatively determining Veronal partitions until said 
2te function, between two successive iterations, is less than a specified 

threshold. 



Claim ,8 (Originai): The system of Cairn .3, wherein said instruction for cluslering 
includes instructions for analyzing word data. 

Cain, 1 9 (Original,: The system of Cairn 1 3, wherein said instruction for clustering 
includes instructions for analyzing data records having numerical and categories 

attributes. 



Claim 20 (Currently Amended): A program storage device readable by machine, tangibly 
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embodying a program of instructions executable by said machine to perform a method for 
evaluating and outputting a final clustering solution from a set of data records having 
multiple, heterogeneous feature spaces represented as feature vectors, said method 

comprising: 

defining a distortion between two feature vectors as a weighted sum of distortion 
measures on components of said feature veetor vectors : 

clustering said multi-dimensional data records into k-clusters using a "convex 
programming" convex nropramminp f^lntimi; ind 
selecting feature weights of said feature vectors:, 
wherein said selecting of feature weights are optimized by an "objective" 
objective function to produce said solution of a final clustering that simultaneously 
minimizes average intra-cluster dispersion and maximizes average inter-cluster dispersion 
along all said feature spaces. 

Claim 21 (Cancelled). 

Claim 22 (Original): The device of ciaim 20. wherein said steering includes i„ ilial | y 

applying a local minima of said clustering. 

Claim 23 (Origina,): The device of c.ai m 20, wherein said c.us.ering comprises a k- 
means clustering algorithm. : 

Claim 24 (Currcmly Amended): The device ofclaim »20, wherein said minimizing 
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distortion of individual clusters includes taking said data records and iteratively 
determining Voronoi partitions until said ^ectf^ objective function, between two 
successive iterations, is less than a specified threshold. 

Claim 25 (Original): The device of claim 20, wherein said clustering comprises analyzing 
word data, and said feature vectors comprise multiple-word frequencies of said data 
records. 

Claim 26 (Currently Amended): The device of claim 20, wherein said clustering 
comprises analyzing data records having numerical and categorical attributes, and said 
feature vectors comprise linearly-scaled numerical attributes a^reach^^egorieat 
fcMun. uAiuga 1-in u auiuuuatkm „f Juid data iu. ul d s . 
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